Pronoun Resolution and Summary Extraction From English Documents

نویسندگان

  • Naman Kumar Gupta
  • Saurabh Garg
چکیده

This paper presents an approach to generate a precise and meaningful summary of a Document in English Language. Here, we adopt a modified Sidner’s Focusing algorithm to perform pronoun resolution. We devise an algorithm to divide any sort of compound and complex sentences into simple sentences. Anaphora resolution is performed on these simple sentences. Then we find the lexical cohesion between pairs of the Noun Phrases whose antecedents have already been found.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Iranian EFL learners’ knowledge of overt pronoun constraint in English

The present study aims to investigated Iranian L2 speaker`s knowledge of the overt pronounconstraint (OPC) in English. It also aims to examine L2 learners understanding of UniversalGrammar (Chomsky 1981, 1986, 2000, 2001) and if it is common to all languages. Specifically,this study takes a new look at the L2 acquisition of knowledge of the overt pronoun constraint(Montalbetti 1984) by Persian ...

متن کامل

Multi-Lingual Coreference Resolution With Syntactic Features

In this paper, we study the impact of a group of features extracted automatically from machine-generated parse trees on coreference resolution. One focus is on designing syntactic features using the binding theory as the guideline to improve pronoun resolution, although linguistic phenomenon such as apposition is also modeled. These features are applied to the Arabic, Chinese and English corefe...

متن کامل

Effect of Pronoun Resolution on Document Similarity

This paper presents a novel effect of Pronoun Resolution on measurement of document similarity. In this paper we have studied the effect of pronoun resolution within the framework of the Vector Space Model and Probabilistic Latent Semantic Analysis. For this purpose we have developed a Benchmark Corpus consisting of documents whose similarity scores have been given by human beings. We measured ...

متن کامل

The Referential Versus Non-referential Use of the Neuter Pronoun in Dutch and English

This paper discusses a corpus-based investigation of the distribution of the thirdperson neuter singular pronoun in Dutch (“het”). We labeled all pronominal occurrences of “het” in a large corpus of documents. On the basis of the annotated corpora, we developed an automatic classification system using machine learning techniques to distinguish between the different uses of the neuter pronoun. A...

متن کامل

Similarity-based Multilingual Multi-Document Summarization

We present a new approach for summarizing clusters of documents on the same event, some of which are machine translations of foreign-language documents and some of which are English. Our approach to multilingual multi-document summarization uses text similarity to choose sentences from English documents based on the content of the machine translated documents. A manual evaluation shows that 68%...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009